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The extension of DNA confined to nanochannels has been studied intensively and in detail. Yet 
quantitative comparisons between experiments and model calculations are difficult because most 
theoretical predictions involve undetermined prefactors, and because the model parameters (con¬ 
tour length, Kuhn length, effective width) are difficult to compute reliably, leading to substantial 
uncertainties. Here we use a recent asymptotically exact theory for the DNA extension in the ‘ex¬ 
tended de Gennes regime’ that allows us to compare experimental results with theory. For this 
purpose we performed new experiments, measuring the mean DNA extension and its standard de¬ 
viation while varying the channel geometry, dye intercalation ratio, and ionic buffer strength. The 
experimental results agree very well with theory at high ionic strengths, indicating that the model 
parameters are reliable. At low ionic strengths the agreement is less good. We discuss possible 
reasons. Our approach allows, in principle, to measure the Kuhn length and effective width of a 
single DNA molecule and more generally of semiflexible polymers in solution. 

PACS numbers: 87.15.-v, 36.20.Ey, 87.14.gk 


Nano-confined DNA has recently been intensively 
studied [1-6] as a means of stretching the molecules in or¬ 
der to study local properties (e.g. DNA sequence [7, 8]). 
A fundamental question is how the physical properties of 
the DNA and the solution affect the extent to which the 
molecule is stretched by confinement. Experimentally 
this question has been investigated in detail, varying the 
confinement, the length of the DNA molecule, and the 
properties of the solution (see Ref. [1] for a review). 

It is commonly assumed that DNA can be modeled as 
a semiflexible polymer with hard-core repulsive interac¬ 
tions [9-15]. Measurements [16-19] and theoretical con¬ 
siderations [9, 16] indicate that a worm-like chain model 
may be a good approximation. A recent study [3] com¬ 
pares experimental results for the extension of confined A- 
DNA to results of computer simulations of a self-avoiding 
discrete worm-like chain model, indicating that it may 
describe the experimental results well. 

Yet quantitative comparison between experiments and 
theoretical model calculations has remained difficult for 
at least two reasons. First the model parameters (con¬ 
tour length, Kuhn length £k^ and effective width rceff 
of the semiflexible polymer) are difficult to determine 
reliably: there is substantial uncertainty regarding the 
physical properties of DNA. Second, the model is hard 
to analyse theoretically. But a recent asymptotically ex¬ 
act theory [20, 21] for the extension of a confined self¬ 
avoiding semiflexible polymer in the so-called ‘extended 
de Gennes regime’ [11, 13, 22] overcomes the second dif¬ 
ficulty: it makes precise predictions for the prefactors 


and exponents defining scaling laws as a function of the 
physical parameters, Eqs. (2) below. This opens the pos¬ 
sibility to experimentally determine £k and rceff by mea¬ 
surements of confined DNA. In this article we report on 
experimental results mapping out how the extension of 
confined DNA in the extended de Gennes regime depends 
on channel geometry, ionic strength of the solution, and 
upon the amount of dye bound to the molecule. 

At high ionic strengths we And very good agreement 
between experiment and theory using approximations for 
£k and iCeff that are commonly employed [23-26], and 
taking into account how the contour length depends on 
the amount of dye molecules bound to the DNA. The 
comparison between experiment and theory is so precise 
that it enables us to detect subtle alignment effects [27] 
at the border of the extended de Gennes regime. At low 
ionic strengths the agreement is not as good. This may 
indicate that theoretical estimates of £k and rceff must be 
improved. We expect it is possible to experimentally pre¬ 
cisely determine £k and rceff by extending the approach 
described in this article. 

Extended de Gennes regime. Consider a semiflexible 
polymer of contour length L, Kuhn length £k [9] and ex¬ 
cluded volume V per Kuhn length segment. The excluded 
volume is often written in terms of an effective width iCeff, 
defined by the relation v = (7r/2)^|-rCeff. This expression 
for the excluded volume is based on Onsager’s result [28] 
for the excluded volume of a cylinder of length and 
diameter rceff, which in the limit £k ^ reduces to 
the expression above. We phrase our results in terms of 
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FIG. 1: a Experiment 1. A-DNA in a 150 nm x 108 nm chan¬ 
nel, different buffer concentrations, and different luminosities 
corresponding to different dye loadings. Shown are represen¬ 
tative video frames (scale bar applies to panels b,c as well), b 
Experiment 2. T4-DNA in a nanofunnel in 0.05x and 2xTBE 
solution, varying funnel width Dw at constant Du = 120 nm. 
c Experiment 3. T4-DNA in a 302 nm x 300 nm channel, dif¬ 
ferent buffer concentrations, d Time trace of the fluorescence 
intensity for A-DNA in a 108 nm x 150 nm channel in 5xTBE 
solution, center-of-mass motion subtracted. 

the effective width iCeff. This is customary but we stress 
that, strictly speaking, it is the excluded volume that 
determines the statistics of the polymer, and that even 
for DNA models with only hard-core repulsion between 
rod-like segments, the effective width does not equal the 
actual width of the rods, except in the limit £k ^ 

This is important to consider when evaluating the results 
of simulations. 

The polymer is confined to a channel with cross section 
Dw X Du. The polymer exhibits different confinement 
regimes distinguished by different laws for the polymer 
extension in the channel direction [21]. The extended de 
Gennes regime is defined by the conditions [21] 

£k ^ ^ Djii^/wQf^ , ( 1 ) 

where we assume that Dw ^ ^h- For a square channel 
the corresponding conditions were previously derived and 
discussed in Refs. [10, 11, 13, 20, 22]. In regime (1) exact 
expressions for the mean fi and the variance of the 
extension in the channel direction are known [20, 21], 
provided that the contour length is long enough [20]: 

/i/L = 0.9338(84) , (2a) 

= 0.364(17). (2b) 

The errors quoted for the coefficients reflect strict bounds 
[20] derived from the exact results of Ref. [29]. Provided 
L is known Eqs. (2) allow to infer and rceff from mea¬ 
surements of nanoconfined DNA molecules. 

Calculation of parameters. We now discuss how the 
parameters L, Ik and rceff are commonly estimated for 


0.05xTBE 0.5xTBE 2xTBE 5xTBE 


/s[mM] 

3.81 

24.9 

78.4 

178.0 

£k [nm] 

154.4 

116.5 

105.9 

101.2 

i(;eff[nm] 

26 

10 

6.2 

4.6 


TABLE I: Numerical values for the ionic buffer strength A, 
the Kuhn length ^k, and the effective width Wef£ (see text). 


DNA, and what the main uncertainties are. We first dis¬ 
cuss bare DNA, before considering the effect of staining 
with fluorescent dye (YOYO-1). 

The contour length of bare DNA is 0.34 nm per base 
pair, with an uncertainty of about 0.01 nm, or 3% [30]. 

DNA is commonly modeled as a worm-like chain, for 
which the Kuhn length is twice the persistence length, 
Ik = 2^p [9]. The persistence length has been measured 
by a number of different techniques [16, 17, 19, 31] yield¬ 
ing £p ^ 45-50 nm at high ionic strength (Jg > 10 mM), 
and increasing at lower Jg. Following Refs. [3, 25, 26, 32] 
we use the empirical formula suggested in Ref. [23] £p ~ 
46nm + 1.92 Jg nm. While this dependence of £p 

upon Jg has a theoretical basis, the prefactors are known 
only from an empirical fit with an uncertainty of about 
10% (Fig. 3 in Ref. [23]). The resulting values of £k are 
given in Table I, the calculation of Jg is described in the 
Supplemental Material [33]. 

Stigter [24] computed the excluded volume between 
two long, strongly charged cylinders in NaCl solution, 
and applied this calculation to DNA to obtain an es¬ 
timate of rceff. Linear interpolation on a doubly loga¬ 
rithmic scale of the effective widths given in Table 1 of 
Ref. [24] yields the values tabulated in Table 1. There are 
many sources of uncertainty when applying this theory 
to our system. Stigter’s calculation [24] for rceff assumes 
that the Kuhn length segments can be approximated by 
infinitely long cylinders with an intrinsic width of 1.2 
nm, and that the effective line charge of DNA is given by 
0.73e“ per phosphate group. The approximation of infi¬ 
nite cylinders is problematic when the Kuhn length and 
the effective width are of the same order, i.e. for low ionic 
strengths. According to Stigter, the value 1.2 nm has an 
uncertainty of about 20%, leading to an uncertainty for 
the effective width of 5-10% [34], with a larger effect at 
large ionic strengths. The effective line charge estimate 
is based on measurements in NaCl-solutions, which do 
not generalise to other ions [35]. 

The DNA contour length is expected to increase in 
proportion to the amount of dye bound. We assume 
that dye intercalation extends the bare contour length 
of the DNA molecule by 0.44 nm per dye molecule [3]. 
Estimates of this number range from about 0.4 nm to 
0.5 nm, with large uncertainties in the individual esti¬ 
mates [36, 37], the uncertainty is at least 10%. At a 
dye loading of 1 molecule per 10 base pairs, this corre- 
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sponds to an uncertainty in the contour length of ~ 2%. 
There is no consensus regarding how intercalating dye 
molecules affect the parameters £k and rceff. Ref. [38] 
finds that the Kuhn length decreases with increasing dye 
load, whereas Ref. [37] finds no dependence. Since the 
dye molecules are positively charged, the effective width 
might decrease with dye load, but the magnitude of this 
effect is not known. Lacking a better estimate, it is com¬ 
monly assumed that YOYO- binding does not affect these 
parameters [3, 26, 39]. In summary, while the contour 
length of DNA is known to a rather high accuracy, there 
is substantial uncertainty regarding the parameters Ik 
and rceff. 

Experimental method. The experimental data are ob¬ 
tained measuring the extension of single DNA molecules 
in nanochannels under different conditions (Fig. la to c). 
We use linear A- and T4GT7-DNA (T4-DNA for short) 
with definite contour lengths of 48502 and 165647 base 
pairs, respectively (this ensures that L is large enough 
in our experiments, it exceeds by an 

order of magnitude [21]). The molecules are stained with 
YOYO dye and suspended inside a channel in a TBE 
(Tris-Borate-EDTA) buffer. 

The first experiment is a re-analysis of data presented 
in Ref. [32]. In this experiment (Eig. la), A-DNA is in¬ 
serted into a nanochannel of height Dh = 150 nm and 
width Dw = 108 nm. We discuss the uncertainty in the 
channel dimensions in the Supplemental Material [33]. 
DNA extensions are measured at different buffer condi¬ 
tions (0.05 X, 0.5 X, 2x and 5xTBE), and at different dye 
loads. To estimate the dye load of a molecule we assume 
that it is proportional to the luminosity, and that the 
largest observed luminosity corresponds to full intercala¬ 
tion (one dye molecule per four base pairs). In this way 
we obtain an estimate for the amount of dye bound to 
the molecule by linear interpolation. 

In experiment 2 (Eig. lb), T4-DNA is inserted into a 
nanofunnel, with fixed height Dh = 120 nm and gradu¬ 
ally changing width from D^^ = 92 nm to Dw = 815 nm 
over a length of 500 jam. These experiments are at two 
different buffer concentrations (0.05 x and 2xTBE). 

In experiment 3 (Eig. Ic), T4-DNA is inserted into 
a channel with Dw = 302 nm, Dh = 300 nm. The 
buffer concentration is varied (0.05x, 0.5x, 2x, and 
5xTBE). In experiments 2 and 3, the average dye load 
at 0.05 XTBE is approximately 1 dye molecule per 10 
base pairs. Assuming that the dye load is proportional 
to luminosity we estimate the dye load in experiment 2 
at 2xTBE to 1 dye molecule per 45 base pairs, and 1 
per 12, 16, 28 base pairs at 0.5x, 2x, and 5xTBE in 
experiment 3. 

Eor each molecule 200 frames are recorded. Eig. Id 
shows an example of a fluorescence-intensity trace (‘ky¬ 
mograph’) obtained in this way. Each row in the kymo¬ 
graph shows the fluorescence intensity in a given frame 
averaged over the channel cross section. Bright regions 


correspond to high intensity indicating where the DNA 
molecule is located. The extension of the molecule along 
the channel is simply the width of the bright region. 

Eor a given set of parameters we estimate the mean 
and standard deviation of the extension by a linear mixed 
model that takes into account the fact that the measured 
extensions are correlated in time. Details concerning the 
experimental method and the data analysis are given in 
the Supplemental Material [33]. 

Results. Our results are shown in Eig. 2. We plot two 
theoretical curves. The solid curve uses the actual chan¬ 
nel size Dh X Dw The dashed curve compensates for the 
repulsive interaction with the negatively charged walls 
[1], by using an ‘effective channel size’ {Du—S) x {D\\/—S). 
We take 6 = rceff, but it is not known how accurate this 
estimate is. Since the standard deviation is independent 
of channel size in the extended de Gennes regime, the 
compensation does not affect this comparison. 

The results of experiment 1 are shown in panels a and 
b. At low relative luminosity (small dye-to-basepair ra¬ 
tio) the average extension is well described by Eq. (2a). 
Eor the standard deviation there are larger differences 
between experiment and Eq. (2b). Possible reasons are 
discussed below. 

We turn now to the effect of increasing the dye-to- 
basepair ratio of the DNA. The theoretical lines are cal¬ 
culated under the assumption that each dye molecule in¬ 
creases the contour length by 0.44 nm but leaves the 
Kuhn length and the effective width unchanged. This 
yields estimates of the mean and standard deviation that 
overestimate the observables at high ionic strengths and 
high dye loads. A simple explanation would be that 
the persistence length decreases slightly with increasing 
dye load, in agreement with Ref. [38] though not with 
Ref. [37]. Note that since experiments 2 and 3 were 
performed at low dye-to-basepair ratios, such a decrease 
would not significantly influence the interpretations of 
these experiments. 

The results of experiment 2 are shown in panels c, d. 
Again, the experimental results are in qualitative agree¬ 
ment with the theoretical predictions. We see that the 
average extension agrees well with the theoretical predic¬ 
tion. However, the model predictions underestimate the 
standard deviation for the larger ionic strength, and for 
the largest channel at the lower ionic strength. 

It is important to note that for experiment 2 we do 
not expect perfect agreement with Eqs. (2), since the 
condition Du ^ £k is not satisfied, or only weakly satis¬ 
fied. However, as long as Dw ^ £k^ the violation of the 
condition for Du only affects the prefactors but not the 
power of Dw in Eqs. (2). This follows from the fact that 
a mapping to a one-dimensional model is possible also 
when Du ~ £k [21]. In accordance with this prediction, 
the data points satisfying Dw ^ £k in panel c obey the 
scaling /r oc D£^^^ of (Eq. 2a). Similarly the data points 
at 2 X TBE which satisfy Dw ^ £k show a variance that 
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FIG. 2: Experimental results for 0.05xTBE (red o), 0.5xTBE (green □), 2xTBE (blue 0), and 5xTBE (black A) a, b 
Experiment 1. Mean and standard deviation of the extension of A-DNA in a narrow nanochannel, as a function of relative 
luminosity. Theory [Eq. (2)], solid lines. The rigorous bounds on the prefactor in Eq. (2b) are indicated as a shaded region for 
0.05 xTBE, they are of the same order for the other cases. The corresponding uncertainty for the extension is much smaller and 
not shown. The dashed line shows theory corrected for wall repulsion (see text), c, d Experiment 2. Same, but for T4-DNA 
in a nanofunnel with varying width Dw- Note that panel c is a log-log plot, e, f Experiment 3. Same, but for T4-DNA in a 
wider square nanochannel, as a function of buffer concentration (xxTBE). Error bars correspond to 95% confidence intervals 
from the statistical analysis, the experimental uncertainty is not taken into account. 


is approximately independent of Dw, in agreement with 
Eq. 2b, Fig. 2d. For the two rightmost data points at 
0.05xTBE, the condition <C Dili‘S /is violated. 
At this point the variance is expected to start increasing 
as Dw increases further [21], in perfect agreement with 
what is observed. 

For square channels simulations [11, 15, 22] show that 
the mean extension increases more rapidly with decreas¬ 
ing D = = Dw than Eq. (2a) predicts, when D ^ Ik- 

The reason is that there is a tendency for the DNA 
molecule to align with the wall, and that the presence 
of the walls makes it more difficult for the molecule 
to change direction in the channel, forming a ‘hairpin’ 
[15, 27]. This can explain why the average extension ap¬ 
pears to increase slightly faster with decreasing width 
than Eq. (2a) predicts, for the leftmost points in panel c. 
Such a trend has also been observed in previous measure¬ 
ments in rectangular channels [27, 40]. 

Now consider the standard deviation. The alignment 
and correlation effects mentioned above cause a to be 
overestimated [27]. But when /i approaches the maxi¬ 
mal extension L then fluctuations are suppressed [15]. 
These two effects could explain why, in experiments 1 
and 2, cr is larger than predicted by theory at high ionic 
strengths but smaller for low ionic strengths and small 
channel sizes. It must also be noted that the standard 
deviation is difficult to estimate precisely, as it is not very 
much larger than the pixel size in the image (159 nm). 


and may depend on the assumptions entering into the 
statistical analysis (see Supplemental Material [33]). An 
additional source of uncertainty specific to experiment 2 
is that T)w changes over the span of the molecule. For 
the most extended condition (/i = 37 |Lim), the channel 
width at either end of the molecule differs by approxi¬ 
mately 25 nm from the stated width, measured at the 
center of the molecule. 

The results of experiment 3 are shown in panels e, f. 
Here Eq. (1) is well satisfied. Simulations indicate [15, 22] 
that the alignment effects discussed above have little in¬ 
fluence on /i, in square channels with D ^ 3^k- Equally 
sensitive simulation results for a have not been published, 
but simulations of the alignment effect [15, 27] indicate 
that Eq. (2b) underestimates a by approximately 10%. 
We And that for the three largest ionic strengths, mea¬ 
surements are in excellent agreement with theory. The 
mean extension (panel f) agrees very well with the theo¬ 
retical prediction of Eq. (2a), and Eq.(2b) underestimates 
the standard deviation (panel f) by about 10%, just as 
the measurements of alignment effects would suggest. 

Intriguingly, the relation between measurements and 
predictions is different at 0.05xTBE than at high ionic 
strengths. Both mean and standard deviation are smaller 
than expected. This is particularly surprising consid¬ 
ering that alignment effects should be even stronger at 
low ionic strength (where is larger). The discrepancy 
might indicate that the standard model does not describe 
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the physical parameters well at such low ionic strengths, 
possibly because the high relative concentration of BME 
significantly changes the buffer conditions, lowering the 
pH from ^ 8.5 to ~ 7.5. But we note that it may be 
hard to ensure uniform dye coverage under these condi¬ 
tions [32]. Also, BME is consumed as the experiment 
proceeds. This may change the ionic strength at small 
buffer concentrations, an effect our calculations do not 
include. 

Conclusions. We have compared measurements of the 
extension of confined DNA to asymptotically exact pre¬ 
dictions. Eirst, we find very good agreement between 
experiments and theoretical predictions at high ionic 
strengths. A possible cause for deviations at low ionic 
strengths is that common estimates for rceff and of 
stained DNA are too imprecise. Second, by measuring 
longer time series and more molecules in wider chan¬ 
nels we expect to be able to precisely determine how 
and rceff depend on the ionic strength of the solution. It 
may even be possible to determine and rceff for single 
molecules. We note that rceff is an effective parameter de¬ 
fined in terms of the excluded volume per Kuhn length, 
even for rigid rods is identical to the actual width 

only when ^ Einally the approach described 

here could be used to investigate DNA-wall interactions, 
a question about which little is known, and that is hard 
to describe theoretically. 
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EXPERIMENTAL PROCEDURE 

Channel manufacture. The channels were manufac¬ 
tured as described in Ref. [SI]. The nanochannels used 
in experiment 1 have a depth of Dh = 150 nm. The 
channel cross-section is not perfectly rectangular, but 
rather trapezoidal, with a width at the top (bottom) of 
the channel of 130 nm (87 nm). We assume that these 
channels can be approximated by rectangular channels 
with a width D^^ = 108 nm, which is the arithmetic 
mean of the measurements at the top and bottom of the 
channel. The funnels used in experiment 2 have a depth 
of approximately Dh = 120 nm, and D^^ increases from 
92 nm (top: 111 nm, bottom: 73 nm) at the narrow end 
to 815 nm (top: 830 nm, bottom: 800 nm) at the wide 
end - over a length of 500 pm. Finally, the channels used 
in experiment 3 have a depth Dh = 300 nm, and a width 
Dw = 302 nm (top: 330 nm, bottom: 275 nm). The 
dimensions are measured before the channel is closed by 
the lid. Since the bonding process changes the depth of 
the channel, our values for the depth of the channels have 
a relatively large uncertainty of about 10 nm. 

Buffer preparation. The buffers were obtained by di¬ 
luting lOxTBE tablets from Medicago to the desired con¬ 
centration. IxTBE contains 0.089 M Tris, 0.089 M Bo¬ 
rate and 0.0020 M EDTA. Right before the experiments, 
3% BME (3 pL of BME to 97 pL of sample solution) was 
added to prevent photo-nicking of the DNA. In addition 
to this the micro- and nanochannels in the chip were 
flushed with the right buffer concentration, containing 
3% BME as well, to keep the same environment in the 
chip as in the sample solution. 

Dye intercalation. Since intercalation of YOYO-dye 
molecules extends the contour length of the DNA it is im¬ 
portant to estimate the dye load as accurately as possible. 
In experiment 1, the dye load was not equilibrated be¬ 
tween molecules, resulting in heterogeneous staining [S2]. 
In experiments 2 and 3 the aim was instead to achieve 
homogeneous staining. To this end we first mixed the 
sample in high ionic strength (5xTBE), let it rest for 
at least 20 min, and then diluted to the desired ionic 


strength [S2]. 

DNA insertion. The nanofluidic chips consist of four 
loading wells connected two and two by micro chan¬ 
nels. The microchannels are in turn connected by many 
nanochannels. Sample solution was inserted into one of 
the loading wells. Then the DNA was carried to the 
nanochannels by pressure-driven flow (N2 gas for T4 ex¬ 
periments and air for lambda experiments). The DNA 
was forced into the nanochannels by applying pressure 
over two wells that are connected by a microchannel. 
Once inserted in the channel, the pressure was switched 
off. Before imaging the DNA was allowed to relax for 
about 30 seconds to reach its equilibrium extension. 

Video recording. Videos were recorded using a Pho¬ 
tometries Evolve^^ EMCCD camera. The image pixel 
size is 159.2 nm. For all measurements, the exposure 
time was 100 ms per photograph, but the delay between 
frames differed between experiments. For experiment 1, 
the delay between frames was 84 ms, yielding a frame- 
rate of 184 ms/frame. For experiments 2-3 with T4-DNA, 
the correlation time is significantly longer, so to minimise 
problems with photo bleaching and nicking, the frame- 
rate was decreased. For experiment 2, the frame-rate 
was 0.5 s/frame for the measurements at 0.05 x TBE, 
and 2 s/frame for the ones at 2x TBE. For experiment 
3, the frame-rate was 1 s/frame for all measurements. 

Number of molecules. In experiment 1, 2388 molecules 
were analysed in total. Since the relative intensities are 
distributed unevenly, so are the number of molecules in 
each bin. This leads to very different error estimates for 
different bins in the estimation of the mean and standard 
deviation. We excluded bins with 2 molecules or less from 
the analysis. Further 3 molecules were excluded because 
the extension could not be succesfully extracted from all 
frames. In experiment 2, 9-10 molecules were analysed 
per data point under 0.05 xTBE, and 7 molecules under 
2xTBE. In experiment 3, 30 to 35 molecules were anal¬ 
ysed per measured data point. One outlier at 0.5x TBE 
was removed, as its extension changed abruptly halfway 
through the data series. 
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DATA ANALYSIS 

Extraction of DNA extension from kymographs. Con¬ 
sider a raw kymograph of the form provided in the main 
text (Fig. Id). Dark regions in this image correspond 
to background, the bright regions correspond to fluores¬ 
cence from DNA. The fluorescence intensities measured 
for both regions are subject to noise. This makes it dif¬ 
ficult to reliably identify the end points of the bright 
regions in an automated fashion. In previous studies this 
was achieved by fitting the difference of two sigmoidal 
functions [S3] to each time frame. This method was used 
for analysing experiment 1 [S2]. In experiments 2-3 we 
used a computationally faster method that yields very 
similar results for the mean, and agrees to within about 
5% for the standard deviation of the extension. The new 
method for detecting end positions relies on the assump¬ 
tion that the fluorescence intensities from DNA and from 
the background assume consistent and easily differenti¬ 
ated values, and proceeds as follows. First, each frame in 
the raw kymograph is smoothed using a moving average 
(we use an averaging window that is five pixels wide). 
Second, each frame is then segmented into binary low- 
and high-intensity regions using Otsu’s method [S4, S5]. 
Third, we even out gaps between high-intensity regions 
that are equal to or shorter than five pixels. Fourth, the 
largest connected high-intensity component is found, and 
its edges are identified. The distance between these edges 
is the extension of the DNA molecule. 

Statistical analysis. The experimental data consist of 
repeated measurements of the extension of individual 
DNA molecules at given values of parameters and are 
modeled using a random-coefficient model, a variation of 
a linear-regression model adapted to correlated data [S6]. 
Let yij denote the extension of molecule i (i = 1,..., n) 
at time tj (j = 1,..., 200). The extension is modeled as 
a linear function of time as yij = // + I3tj + + Cij , where 

/i and P denote the overall mean value and slope. To ac¬ 
count for the within-molecule correlation of data points 
the error term is split into two uncorrelated components, 
a molecule-specific correction to the mean, Ui^ and an er¬ 
ror term Cij that measures the deviation of a data point 
from the molecule-specific regression line, /r + f3tj -h Ui. 
Both Ui and Cij are assumed to be independently nor¬ 
mally distributed with mean zero and variance and , 
respectively. The model describes the mean extension 
of molecules at a particular set of external parameters 
and the linear time dependence allows to include small 
unmeasured changes of external conditions, for instance 
drift into wider or narrower channel regions, or change 
of buffer concentration. The random intercept adjusts 
for the initial conditions at the start of measurement of 
each time series. A random intercept model is adequate 
as each set of molecules can be viewed as a random sam¬ 
ple representative for all molecules at this particular set 


of external parameters. On average, the covariance of 
two arbitrary data points within molecules is equal to 
the variance of intercepts between molecules. The ra¬ 
tio of variances, r^/(r^ + a^) is both a measure for the 
proportion of variation that is explained by differences 
between molecules, and the correlation of data points 
within a typical time series. By isolating the molecule- 
specific random variation due to unknown fluctuations of 
external parameters we are able to obtain an improved 
value for the residual variance of the extension (a^) that 
can be compared with the theory. Although the time 
trend was weak in most cases, we evaluated the mean 
value /r at t = 100, i.e. we replaced tj with {tj — 100) in 
the above model. Results are given in terms of point esti¬ 
mates together with a 95% confidence interval (95% Cl). 
The statistical analyses were performed by using SAS 
(version 9.4; SAS Institute, Cary, NC). 


CALCULATION OF IONIC STRENGTH 

The parameters w and of the DNA molecule depend 
on the ionic strength of the surrounding solution. The 
ionic strength is defined as 

i 

Here, the sum runs over all ions in the solution, q is 
the concentration of ion species i, and Zi its valence. To 
compute the ionic strength, we must calculate the equi¬ 
librium concentrations q of all ions. Since there is some 
confusion in the literature (discussed below) about the 
ionic strength of TBE buffer, we document our calcula¬ 
tion in some detail. 

We follow the method outlined in Ref. [S7, Section 12- 
2]. Denote by C[X] the total amount of all species of a 
substance X, in neutral and ionic form. At Ax TBE, the 
solution contains C[T] = Ax 0.089 M Tris, C[Bo] = 
A X 0.089 M Borate, C[E] = N x 0.0020 M EDTA, 
and C[(3] = 0.429 M BME. The dissociation and asso¬ 
ciation constants that determine the chemical equilib¬ 
rium are Tris: pA^ = 5.94; Borate: pA^ = 9.24, EDTA: 
pKa = {1.99,2.67,6.16,10.26}; BME: pA^ = 9.6. At 
the resulting pH-value of approximately 8.5, most of 
the EDTA is triply ionised, and we can safely ignore 
the minute concentrations of neutral and singly ionised 
EDTA. We denote the molar concentration of a species by 
[X], and its activity coefficient by yx- The activity coeffi¬ 
cients are assumed to be given by the Davies equation, as 
stated in Ref. [S7] (the empirical prefactors of this equa¬ 
tion differ between sources. Eor our ionic strength cal¬ 
culations, the difference between different formulations 
makes a difference of two percent at most). The system 



3 


of equations that must be solved is 


according to Eq. SI by 


[TH+]7th+[OH-]7oh- 

[T]7t 

[Bo~]7bo- [H+]7h+ 
[HBo]7hbo 

[HE3-]7he3-[H+]7h+ 

[H2E2-]7h,e3- 


10“®-^^ (S2) 
10“®-^^ (S3) 
10 - 6.16 


[E4-]7e4-[H+]7h+ 

[HE3-]7he3- 

[13-]7p-[H+]7h+ 

[H(3]7h(3 

[H+]7h+[OH-]7oh- 
[T] + [TH+] 
[HBo] + [Bo“] 
[HaE^-] + [HE®-] + [E^-j 
[H13] + [p-j 
[TH+] + [H+] 
[OH-] + [Bo-j + 2[H2E2-] + 3[HE®-] 


= 10-10-26 (S5) 

= 10-O-6 

(S6) 

= 10-1^-° (S7) 

= C[T] 

(S8) 

= C[Bo] 

(S9) 

= C[E] 

(SIO) 

= C'[P] 

(Sll) 

= 

(S12) 

+ 4[E^-] 

+ [r] 


Eqs. (S2)-(S6) are the equilibrium conditions from the 
law of mass action, Eqs. (S7)-(S11) ensure that the total 
concentration of a substance is conserved, and Eq. (S12) 
ensures charge neutrality. The activity coefficients yx 
depend on the ionic strength Ig according to the Davies 
equation [S7], 


logio 7x = -0.514 (- 0.3/.) . (S13) 

Here, zx is the valence of species X, and Is is measured 
in units of M. Einally, the ionic strength itself is given 


Is = 1/2 (^[TH+] + [H+] + [OH-] + [Bo-]+ (S14) 

4[H2E2-] + 9[HE®-] + 16[E4-] + [P”] 

We solve the system of equations (S2)-(S14) iteratively. 
Starting from the initial guess yx = 1 for all species, 
Eqs. (S2)-(S12) are solved with the Mathematica routine 
Findinstance. Plugging the resulting concentrations 
into Eqs. (S13)-(S14) yields new values for yx, which are 
then used in Eqs. (S2)-(S12). This process was repeated 
until the ionic strength converged. 

We tested our calculations by comparing with Table 1 
of Ref. [S8]. We reproduce the reported ionic strengths 
for 0% BME and 0.5% BME to within one percent. 
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